Image retrieval system and method

ABSTRACT

An image retrieval method applies an application server, one or more calculating servers, and a sorting server to perform image retrieval. The application server extracts visual features of an exemplary image. The one or more calculating servers calculate similarities of available images according to the visual features of the exemplary image. The sorting server sorts the available images according to the similarities so as to obtain images similar to the exemplary image.

CROSS-REFERENCE TO RELATED APPLICATION

This application is a continuation application of U.S. application Ser. No. 12/635,849, filed on Dec. 11, 2009.

BACKGROUND

1. Technical Field

Embodiments of the present disclosure relate to information retrieval, and particularly to an image retrieval system and method.

2. Description of Related Art

With the rapid development of multimedia technology and computer networks, digital images are widespread. Presently, popular search engines, such as Google, Yahoo, and MSN, provide an image retrieval function for searching images of interest. Most image retrievals add metadata, such as captions, keywords and/or descriptions to the images, and search the images according to the metadata. Such image retrieval is referred to as text-based image retrieval. However, if images similar to an exemplary image are required to be found from an image database, the text-based image retrieval cannot fulfill the image search task. In addition, most image related applications require much computation. Therefore, it may take a long time to implement the image retrieval.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a block diagram of one embodiment of an image retrieval system.

FIG. 2 is a block diagram of one embodiment of an image retrieving unit of an application server in FIG. 1 comprising function modules.

FIG. 3 is a flowchart of one embodiment of an image retrieval method.

DETAILED DESCRIPTION

All of the processes described below may be embodied in, and fully automated via, functional code modules executed by one or more general purpose computers or processors. The code modules may be stored in any type of computer-readable medium or other computer storage device. Some or all of the methods may alternatively be embodied in specialized computer hardware.

FIG. 1 is a block diagram of one embodiment of an image retrieval system 10. The image retrieval system 10 may be used to quickly search images similar to an exemplary image from available images. The image retrieval system 10 may include at least one client computer 11 (only one shown in FIG. 1), an application server 12, calculating servers 13A-13C, a sorting server 14, an index server 15, and an image server 16. The application server 12 is connected to the client computer 11, the calculating servers 13A-13C, the sorting server 14, and the image server 16. The calculating servers 13A-13C are further connected to the index server 15. The client computer 11 may be connected to a display screen 17.

The image server 16 may include an image database 160 that stores the available images. The index server 15 may include an image index database 150 that stores image indexes of the available images. The available images can be fetched from the image database 160 according to image indexes of the available images. In one embodiment, the image indexes include image locations in the image server 16, metadata, and visual features of the available images. The metadata may include captions, keywords, and/or descriptions of the available images. The visual features may include color features and shape features. In one example, the available images are patent images. Metadata of each patent image may include a patent name, an inventor name, a patent number, an application number, an application date, an issue date, and an international classification. Format of the images may include, but are not limited to PDF, JPG, GIF, TIFF, for example.

The client computer 11 provides a user interface to receive an exemplary image and transfer the exemplary image to the application server 12. Furthermore, the client computer 11 receives a search result from the application server 12 and displays the search result on the display screen 17.

The application server 12 may include a storage system 120, at least one processor 121, and an image retrieving unit 122. One or more computerized codes of the image retrieving unit 122 is stored in the storage system 12 and executed by the at least one processor 13. In one embodiment with respect to FIG. 2, the image retrieving unit 122 includes an extracting module 200, a distributing module 210, a gathering module 220, a sorting module 130, and an outputting module 240.

The extracting module 200 is operable to extract visual features of the exemplary image and transfers the visual features to the calculating servers 13A-13C. The visual features of the exemplary image may include color features, such as an arithmetic mean of pixel values of the exemplary image. The visual features of the exemplary image may further include shape features, such as an outline of the exemplary image.

The distributing module 210 is operable to allocate image comparison tasks to the calculating servers 13A-13C. In one embodiment, the distributing module 210 equally allocates the image comparison tasks. For example, each of the calculating servers 13A-13C is allocated an image comparison task of 1000 exclusive available images. The calculating servers 13A-13C calculate a similarity of each of the allocated available images according to the visual features of the exemplary image and the visual features of the allocated available image. The calculating servers 13A-13C fetch image indexes of each of the allocated available images from the index server 15. Additionally, the calculating servers 13A-13C transfer the similarities and the image indexes of the allocated available images to the application server 12. A similarity of an available image indicates a similar degree between the available image and the exemplary image.

The gathering module 220 is operable to gather the similarities and the image indexes of all the available images from the calculating servers 13A-13C.

The sorting module 230 is operable to transfer the gathered similarities of the available images to the sorting server 14, and send a sorting command to the sorting server 14. In response to the sorting command, the sorting server 14 sorts the available images according to the gathered similarities of the available images, and returns a sorting sequence of the available images to the application server 12.

The outputting module 240 is operable to receive the sorting sequence of the available images from the sorting server 14, and output the image indexes of the available images in the sorting sequence to the client computer 11. In one embodiment, the outputting module 240 outputs a specified amount of the image indexes of the available images to the client computer 11.

FIG. 3 is a flowchart of one embodiment of an image retrieval method. The image retrieval method can quickly search images similar to an exemplary image. Depending on the embodiments, additional blocks may be added, others removed, and the ordering of the blocks may be changed.

In block S301, the client computer 11 receives the exemplary image and transfers the exemplary image to the application server 12. In one embodiment, the client computer 11 provides a user interface to receive the exemplary image.

In block S302, the extracting module 200 extracts visual features of the exemplary image and transfers the visual features to the calculating servers 13A-13C. The visual features of the exemplary image may include color features and shape features. In one example, the extracting module 200 extracts color features of the exemplary image if a color search mode is specified. In another example, the extracting module 200 extracts shape features if a shape search mode is specified.

In one embodiment, the extracting module 200 divides the exemplary image into a plurality of image blocks and extract visual features of the image blocks. Overall visual features of the exemplary image may be determined according to the visual features of the image blocks. In one example, the exemplary image is a gray image and equally divided into 256 image blocks. The extracting module 200 may calculate an arithmetic mean of gray values of pixels in each image block as color features of the image block. The extracting module 200 integrates the color features of the 256 image blocks to obtain overall color features of the gray exemplary image. In another example, the extracting module 200 may transform the gray exemplary image into a black-and-white exemplary image and divide the black-and-white exemplary image into 256 image blocks. A ratio of a black pixel amount to a total pixel amount in the image block may be calculated as shape features of the image block. The extracting module 200 integrates the shape features of the 256 image blocks to obtain overall shape features of the gray exemplary image. If the exemplary image is a color image, the extracting module 200 may transform the color exemplary image into a gray image and extract visual features from the gray image.

It may be understood that the visual features of the available images may be predetermined using a same method as the exemplary image. In one embodiment, each of the available images is divided into a same amount of image blocks as the exemplary image. The visual features of the available image are obtained by integrating visual features of each image block of the available image.

In block S303, the distributing module 210 allocates image comparison tasks of the available images to the calculating servers 13A-13C. In one embodiment, the distributing module 210 equally allocates the image comparison tasks. In one example, there are 3000 available images. The distributing module 210 may allocate an image comparison task of 1000 exclusive available images to each of the calculating servers 13A-13C.

In block S304, the calculating servers 13A-13C calculate a similarity of each of the allocated available images according to the visual features of the exemplary image and the visual features of the allocated available image. Furthermore, the calculating servers 13A-13C fetch image indexes of each of the allocated available images from the index server 15, and transfer the similarity and the image indexes of the allocated available images to the application server 12. The image indexes of the available images may include image locations, visual features, and metadata.

As mentioned above, the visual features of the exemplary image and the available images may be divided into a same amount of image blocks. The calculating servers 13A-13C may calculate a similarity of each image block of an available image, and calculates an overall similarity of the available image according to the similarities of the image blocks. In one example, the calculating module 13A-13C may calculate an arithmetic mean of the similarities of the image blocks as the overall similarity of the available image.

The similarity of an image block of the available image may be determined according to a difference between the image block of the image and a corresponding image block of the exemplary image. For example, gray values of the exemplary image and an available image are 0-255, an arithmetic mean of an image block of the available image is 100, and an arithmetic mean of a corresponding image block of the exemplary image is 110. Thus, a similarity of the image block of the available image may be determined as |110−100|/255.

In block S305, the gathering module 220 gathers the similarities and the image indexes of all the available images from the calculating servers 13A-13C. In one example, each of the calculating servers 13A-13C performs an image comparison task of 1000 available images, and returns similarities and image indexes of the 1000 available images. The gathering module 220 gathers all the similarities and image indexes, so as to obtain similarities and image indexes of the 3000 available images.

In block S306, the sorting module 230 transfers the gathered similarities of the available images to the sorting server 14, and sends a sorting command to the sorting server 14.

In block S307, the sorting server 14 sorts the available images according to the similarity of each of the available images, and returns a sorting sequence of the available images to the application server 12. The sorting server 14 may sort the available images using a sorting algorithm, such as bubble sort, insertion sort, and merge sort. In one example, the sorting server 14 uses a bubble sort algorithm to sort the 3000 available images in an ascending order according to the similarities.

In block S308, the outputting module 240 receives the sorting sequence of the available images from the sorting server 14. The outputting module 240 outputs the image indexes of the available images in the sorting sequence to the client computer 11. In one embodiment, the outputting module 240 outputs a specified amount of image indexes of the available images to the client computer 11. For example, the outputting module 230 sends top 100 image indexes of the available images to the client computer 11.

In block S309, the client computer 11 displays the image indexes of the available images in the sorting sequence on the display screen 17. Therefore, an available image in a former order may be more similar to the exemplary image. In one embodiment, if an available image is user-selected, the application server may fetch the available image from the image database 160.

Although certain inventive embodiments of the present disclosure have been specifically described, the present disclosure is not to be construed as being limited thereto. Various changes or modifications may be made to the present disclosure without departing from the scope and spirit of the present disclosure. 

1. An image retrieval system for searching images similar to an exemplary image from available images, the image retrieval system comprising: a storage system; at least one processor; an image retrieval unit comprising one or more computerized codes that are stored in the storage system and executed by the at least one processor, the one or more computerized codes comprising: an extracting module operable to extract visual features of the exemplary image and transfer the visual features of the exemplary image to one or more calculating servers, wherein the visual features of the exemplary image comprise color features and shape features, and wherein the visual features of the exemplary image are extracted by dividing the exemplary image into a plurality of image blocks, calculating an arithmetic mean of gray values of pixels in each image block as color features of the image block, calculating a ratio of black pixels to total pixels in the image block as shape features of the image block, and determining overall visual features of the exemplary image according to the color features and the shape features of the image blocks; a distributing module operable to allocate image comparison tasks of the available images to the one or more calculating servers, such that the one or more calculating servers calculate similarities of the available images to the exemplary image according to the visual features of the exemplary image and predetermined visual features of the available images, fetch image indexes of the available images from an index server, and return the similarities and the image indexes of the available images to the image retrieval system; a gathering module operable to gather the similarities and the image indexes of all the available images from the one or more calculating servers; a sorting module operable to transfer the gathered similarities of the available images to a sorting server, such that the sorting server sorts the available images according to the gathered similarities of the available images; and an outputting module operable to receive a sorting sequence of the available images from the sorting server, and output the image indexes of the available images in the sorting sequence to a client computer.
 2. The image retrieval system of claim 1, wherein each of the available images is divided into a same amount of image blocks as the exemplary image, a similarity of each image block of an available image is calculated, and an overall similarity of the available image to the exemplary image is calculated according to the similarities of all the image blocks of the available image.
 3. The image retrieval system of claim 2, wherein an arithmetic mean of the similarities of all the image blocks of the available image is calculated as the overall similarity of the available image to the exemplary image.
 4. The image retrieval system of claim 2, wherein the similarity of an image block of the available image is determined according to a difference between the image block of the available image and a corresponding image block of the exemplary image.
 5. The image retrieval system of claim 1, wherein the image indexes of the available images comprise image locations, metadata, and visual features of the available images.
 6. The image retrieval system of claim 5, wherein the metadata of the available images comprise captions, keywords, and/or descriptions of the available images.
 7. The image retrieval system of claim 1, wherein the exemplary image and the available images are in portable document format (PDF), joint photographic experts group (JPEG), graphic interchange format (GIF), or tagged image file format (TIFF).
 8. The image retrieval system of claim 1, wherein the image comparison tasks are equally allocated to the calculating servers.
 9. The image retrieval system of claim 1, wherein a specified amount of the image indexes of the available images is output.
 10. A computer-based image retrieval method for searching images similar to an exemplary image from available images, the image retrieval method being executed by a processor of a computing device and comprising: extracting visual features of the exemplary image and transferring the visual features of the exemplary image to one or more calculating servers, wherein the visual features of the exemplary image comprise color features, and the color features of the exemplary image are extracted by dividing the exemplary image into a plurality of image blocks, calculating an arithmetic mean of gray values of pixels in each image block as color features of the image block, and determining overall color features of the exemplary image according to the color features of the image blocks; distributing image comparison tasks of the available images to the one or more calculating servers, such that the one or more calculating servers calculate similarities of the available images to the exemplary image according to the visual features of the exemplary image and predetermined visual features of the available images, fetch image indexes of the available images from an index server, and returns the similarities and the image indexes of the available images; gathering the similarities and the image indexes of all the available images from the one or more calculating servers; transferring the gathered similarities of the available images to a sorting server, and causing the sorting server to sort the available images according to the gathered similarities of the available images; and receiving a sorting sequence of the available images from the sorting server, and outputting the image indexes of the available images in the sorting sequence to a client computer.
 11. The image retrieval method of claim 10, wherein each of the available images is divided into a same amount of image blocks as the exemplary image, and a similarity of one available image is calculated by calculating a similarity of each image block of the one available image, and calculating an overall similarity of the one available image according to the similarities of all the image blocks of the one available image.
 12. The image retrieval method of claim 11, wherein an arithmetic mean of the similarities of all the image blocks of the available image is calculated as the overall similarity of the available image to the exemplary image.
 13. The image retrieval method of claim 11, wherein the similarity of an image block of the available image is determined according to a difference between the image block of the available image and a corresponding image block of the exemplary image.
 14. A computer-based image retrieval method for searching images similar to an exemplary image from available images, the image retrieval method being executed by a processor of a computing device and comprising: extracting visual features of the exemplary image and transferring the visual features of the exemplary image to one or more calculating servers, wherein the visual features of the exemplary image comprise shape features, and the shape features of the exemplary image are extracted by dividing the exemplary image into a plurality of image blocks, calculating a ratio of black pixels to total pixels in the image block as shape features of the image block, and determining overall shape features of the exemplary image according to the shape features of the image blocks; distributing image comparison tasks of the available images to the one or more calculating servers, such that the one or more calculating servers calculate similarities of the available images to the exemplary image according to the visual features of the exemplary image and predetermined visual features of the available images, fetch image indexes of the available images from an index server, and returns the similarities and the image indexes of the available images; gathering the similarities and the image indexes of all the available images from the one or more calculating servers; transferring the gathered similarities of the available images to a sorting server, and causing the sorting server to sort the available images according to the gathered similarities of the available images; and receiving a sorting sequence of the available images from the sorting server, and outputting the image indexes of the available images in the sorting sequence to a client computer.
 15. The image retrieval method of claim 14, wherein each of the available images is divided into a same amount of image blocks as the exemplary image, and a similarity of one available image is calculated by calculating a similarity of each image block of the one available image, and calculating an overall similarity of the one available image according to the similarities of all the image blocks of the one available image.
 16. The image retrieval method of claim 15, wherein an arithmetic mean of the similarities of all the image blocks of the available image is calculated as the overall similarity of the available image to the exemplary image.
 17. The image retrieval method of claim 15, wherein the similarity of an image block of the available image is determined according to a difference between the image block of the available image and a corresponding image block of the exemplary image.
 18. A computer-based image retrieval method for searching images similar to an exemplary image from available images, the image retrieval method being executed by a processor of a computing device and comprising: extracting visual features of the exemplary image and transferring the visual features of the exemplary image to one or more calculating servers, wherein the exemplary image is divided into a plurality of image blocks; distributing image comparison tasks of the available images to the one or more calculating servers, such that the one or more calculating servers calculate similarities of the available images to the exemplary image according to the visual features of the exemplary image and predetermined visual features of the available images, fetch image indexes of the available images from an index server, and returns the similarities and the image indexes of the available images, wherein each of the available images is divided into a same amount of image blocks as the exemplary image, and a similarity of one available image is calculated by calculating a similarity of each image block of the one available image, and calculating an overall similarity of the one available image according to the similarities of all the image blocks of the one available image; gathering the similarities and the image indexes of all the available images from the one or more calculating servers; transferring the gathered similarities of the available images to a sorting server, and causing the sorting server to sort the available images according to the gathered similarities of the available images; and receiving a sorting sequence of the available images from the sorting server, and outputting the image indexes of the available images in the sorting sequence to a client computer.
 19. The image retrieval method of claim 18, wherein the visual features of the exemplary image comprise color features and shape features.
 20. The image retrieval method of claim 19, wherein an arithmetic mean of gray values of pixels in each image block is calculated as color features of the image block, a ratio of black pixels to total pixels in the image block is calculated as shape features of the image block, and overall visual features of the exemplary image is determined according to the color features and the shape features of the image blocks.
 21. The image retrieval method of claim 18, wherein an arithmetic mean of the similarities of all the image blocks of the available image is calculated as the overall similarity of the available image to the exemplary image.
 22. The image retrieval method of claim 18, wherein the similarity of an image block of the available image is determined according to a difference between the image block of the available image and a corresponding image block of the exemplary image.
 23. The image retrieval method of claim 18, wherein the image indexes of the available images comprise image locations, metadata, and visual features of the available images.
 24. The image retrieval method of claim 23, wherein the metadata of the available images comprise captions, keywords, and/or descriptions of the available images.
 25. The image retrieval method of claim 18, wherein the exemplary image and the available images are in portable document format (PDF), joint photographic experts group (JPEG), graphic interchange format (GIF), or tagged image file format (TIFF).
 26. The image retrieval method of claim 18, wherein the image comparison tasks are equally allocated to the calculating servers.
 27. The image retrieval method of claim 18, wherein a specified amount of the image indexes of the available images is output.
 28. A computer-based image retrieval method for searching images similar to an exemplary image from available images, the image retrieval method being executed by a processor of a computing device and comprising: extracting visual features of the exemplary image, wherein the visual features of the exemplary image comprise color features, and the color features of the exemplary image are extracted by dividing the exemplary image into a plurality of image blocks, calculating an arithmetic mean of gray values of pixels in each image block as color features of the image block, and determining overall color features of the exemplary image according to the color features of the image blocks; calculating similarities of the available images to the exemplary image according to the visual features of the exemplary image and predetermined visual features of the available images, and fetching image indexes of the available images from an index server; sorting the available images according to the similarities of the available images to obtain a sorting sequence of the available image; and outputting the image indexes of the available images in the sorting sequence to a client computer.
 29. A computer-based image retrieval method for searching images similar to an exemplary image from available images, the image retrieval method being executed by a processor of a computing device and comprising: extracting visual features of the exemplary image, wherein the visual features of the exemplary image comprise shape features, and the shape features of the exemplary image are extracted by dividing the exemplary image into a plurality of image blocks, calculating a ratio of black pixels to total pixels in the image block as shape features of the image block, and determining overall shape features of the exemplary image according to the shape features of the image blocks; calculating similarities of the available images to the exemplary image according to the visual features of the exemplary image and predetermined visual features of the available images, and fetching image indexes of the available images from an index server; sorting the available images according to the similarities of the available images to obtain a sorting sequence of the available image; and outputting the image indexes of the available images in the sorting sequence to a client computer.
 30. A computer-based image retrieval method for searching images similar to an exemplary image from available images, the image retrieval method being executed by a processor of a computing device and comprising: extracting visual features of the exemplary image, wherein the exemplary image is divided into a plurality of image blocks; calculating similarities of the available images to the exemplary image according to the visual features of the exemplary image and predetermined visual features of the available images, and fetching image indexes of the available images from an index server, wherein each of the available images is divided into a same amount of image blocks as the exemplary image, and a similarity of one available image is calculated by calculating a similarity of each image block of the one available image, and calculating an overall similarity of the one available image according to the similarities of all the image blocks of the one available image; sorting the available images according to the similarities of the available images to obtain a sorting sequence of the available image; and outputting the image indexes of the available images in the sorting sequence to a client computer.
 31. The image retrieval method of claim 30, wherein the visual features of the exemplary image comprise color features and shape features.
 32. The image retrieval method of claim 31, wherein an arithmetic mean of gray values of pixels in each image block is calculated as color features of the image block, a ratio of black pixels to total pixels in the image block is calculated as shape features of the image block, and overall visual features of the exemplary image is determined according to the color features and the shape features of the image blocks.
 33. The image retrieval method of claim 30, wherein an arithmetic mean of the similarities of all the image blocks of the available image is calculated as the overall similarity of the available image to the exemplary image.
 34. The image retrieval method of claim 30, wherein the similarity of an image block of the available image is determined according to a difference between the image block of the available image and a corresponding image block of the exemplary image.
 35. The image retrieval method of claim 30, wherein the image indexes of the available images comprise image locations, metadata, and visual features of the available images.
 36. The image retrieval method of claim 30, wherein the metadata of the available images comprise captions, keywords, and/or descriptions of the available images.
 37. The image retrieval method of claim 30, wherein the exemplary image and the available images are in portable document format (PDF), joint photographic experts group (JPEG), graphic interchange format (GIF), or tagged image file format (TIFF).
 38. The image retrieval method of claim 30, wherein a specified amount of the image indexes of the available images is output. 